Voice-based Age and Gender Recognition using Training Generative Sparse Model
نویسنده
چکیده مقاله:
Abstract: Gender recognition and age detection are important problems in telephone speech processing to investigate the identity of an individual using voice characteristics. In this paper a new gender and age recognition system is introduced based on generative incoherent models learned using sparse non-negative matrix factorization and atom correction post-processing method. Similar to general signal classification scheme, our proposed algorithm includes train step to provide related atoms to each signal class and test phase to assess classification performance. Since the classification accuracy depends highly on the selected features, we employ Mel-frequency cepstral coefficients to train basis for better representation of speech structure. These bases are learned over data of male and female speakers using non-negative matrix factorization with sparsity constraint. Then, atom correction is carried out using an energy-based algorithm to decrease coherence between different categories of trained dictionaries. In sparse representation of each data class, atoms related to other sets with the highest energy are replaced with the lowest energy bases if reconstruction error does not exceed from a specified limit. The experimental results show that the proposed algorithm performs better than the earlier methods in this context especially in the presence of background noise.
منابع مشابه
Face Recognition in Thermal Images based on Sparse Classifier
Despite recent advances in face recognition systems, they suffer from serious problems because of the extensive types of changes in human face (changes like light, glasses, head tilt, different emotional modes). Each one of these factors can significantly reduce the face recognition accuracy. Several methods have been proposed by researchers to overcome these problems. Nonetheless, in recent ye...
متن کاملA Novel Text - Independent Voice based Automatic Gender Recognition System
Voice based gender and age classification can be helpful in a number of Information Technology based applications with speech interfaces. The recognition has to be independent of the text of the input speech if the application is online. In this work three different feature sets were tried for text independent gender recognition. The first set is Mel-Frequency Cepstral Coefficients (MFCC) C1 to...
متن کاملFace Recognition using an Affine Sparse Coding approach
Sparse coding is an unsupervised method which learns a set of over-complete bases to represent data such as image and video. Sparse coding has increasing attraction for image classification applications in recent years. But in the cases where we have some similar images from different classes, such as face recognition applications, different images may be classified into the same class, and hen...
متن کاملVoice Morphing Using the Generative Topographic Mapping
In this paper we address the problem of Voice Morphing. We attempt to transform the spectral characteristics of a source speakers speech signal so that the listener would believe that the speech was uttered by a target speaker. The voice morphing system transforms the spectral envelope as represented by a Linear Prediction model. The transformation is achieved by codebook mapping using the Gen...
متن کاملVoice Impersonation using Generative Adversarial Networks
Voice impersonation is not the same as voice transformation, although the latter is an essential element of it. In voice impersonation, the resultant voice must convincingly convey the impression of having been naturally produced by the target speaker, mimicking not only the pitch and other perceivable signal qualities, but also the style of the target speaker. In this paper, we propose a novel...
متن کاملFeature Extraction based Face Recognition, Gender and Age Classification
The face recognition system with large sets of training sets for personal identification normally attains good accuracy. In this paper, we proposed Feature Extraction based Face Recognition, Gender and Age Classification (FEBFRGAC) algorithm with only small training sets and it yields good results even with one image per person. This process involves three stages: Pre-processing, Feature Extrac...
متن کاملمنابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ذخیره در منابع من قبلا به منابع من ذحیره شده{@ msg_add @}
عنوان ژورنال
دوره 31 شماره 9
صفحات 1529- 1535
تاریخ انتشار 2018-09-01
با دنبال کردن یک ژورنال هنگامی که شماره جدید این ژورنال منتشر می شود به شما از طریق ایمیل اطلاع داده می شود.
میزبانی شده توسط پلتفرم ابری doprax.com
copyright © 2015-2023